Overview

Dataset statistics

Number of variables32
Number of observations1470
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory367.6 KiB
Average record size in memory256.1 B

Variable types

NUM19
CAT10
BOOL3

Reproduction

Analysis started2022-02-27 09:11:47.585086
Analysis finished2022-02-27 09:12:50.528868
Duration1 minute and 2.94 seconds
Versionpandas-profiling v2.7.1
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
MonthlyIncome is highly correlated with JobLevelHigh correlation
JobLevel is highly correlated with MonthlyIncomeHigh correlation
EmployeeNumber has unique values Unique
EducationField has 27 (1.8%) zeros Zeros
JobRole has 52 (3.5%) zeros Zeros
NumCompaniesWorked has 197 (13.4%) zeros Zeros
TrainingTimesLastYear has 54 (3.7%) zeros Zeros
YearsAtCompany has 44 (3.0%) zeros Zeros
YearsInCurrentRole has 244 (16.6%) zeros Zeros
YearsSinceLastPromotion has 581 (39.5%) zeros Zeros
YearsWithCurrManager has 263 (17.9%) zeros Zeros

Variables

Age
Real number (ℝ≥0)

Distinct count43
Unique (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.923809523809524
Minimum18
Maximum60
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum18
5-th percentile24
Q130
median36
Q343
95-th percentile54
Maximum60
Range42
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.135373489
Coefficient of variation (CV)0.2474114564
Kurtosis-0.4041451372
Mean36.92380952
Median Absolute Deviation (MAD)6
Skewness0.4132863019
Sum54278
Variance83.45504879
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
35 78 5.3%
 
34 77 5.2%
 
36 69 4.7%
 
31 69 4.7%
 
29 68 4.6%
 
32 61 4.1%
 
30 60 4.1%
 
38 58 3.9%
 
33 58 3.9%
 
40 57 3.9%
 
Other values (33) 815 55.4%
 
ValueCountFrequency (%) 
18 8 0.5%
 
19 9 0.6%
 
20 11 0.7%
 
21 13 0.9%
 
22 16 1.1%
 
ValueCountFrequency (%) 
60 5 0.3%
 
59 10 0.7%
 
58 14 1.0%
 
57 4 0.3%
 
56 14 1.0%
 

Attrition
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
0
1233
1
 
237
ValueCountFrequency (%) 
0 1233 83.9%
 
1 237 16.1%
 

BusinessTravel
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
2
1043
1
277
0
 
150
ValueCountFrequency (%) 
2 1043 71.0%
 
1 277 18.8%
 
0 150 10.2%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 3 100.0%
 
ValueCountFrequency (%) 
Common 3 100.0%
 
ValueCountFrequency (%) 
ASCII 3 100.0%
 

DailyRate
Real number (ℝ≥0)

Distinct count886
Unique (%)60.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean802.4857142857143
Minimum102
Maximum1499
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum102
5-th percentile165.35
Q1465
median802
Q31157
95-th percentile1424.1
Maximum1499
Range1397
Interquartile range (IQR)692

Descriptive statistics

Standard deviation403.5090999
Coefficient of variation (CV)0.5028240288
Kurtosis-1.203822808
Mean802.4857143
Median Absolute Deviation (MAD)344
Skewness-0.003518568352
Sum1179654
Variance162819.5937
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
691 6 0.4%
 
1082 5 0.3%
 
408 5 0.3%
 
329 5 0.3%
 
530 5 0.3%
 
1329 5 0.3%
 
427 4 0.3%
 
1469 4 0.3%
 
147 4 0.3%
 
1225 4 0.3%
 
Other values (876) 1423 96.8%
 
ValueCountFrequency (%) 
102 1 0.1%
 
103 1 0.1%
 
104 1 0.1%
 
105 1 0.1%
 
106 1 0.1%
 
ValueCountFrequency (%) 
1499 1 0.1%
 
1498 1 0.1%
 
1496 2 0.1%
 
1495 3 0.2%
 
1492 1 0.1%
 

Department
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
2
961
1
446
0
 
63
ValueCountFrequency (%) 
2 961 65.4%
 
1 446 30.3%
 
0 63 4.3%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 3 100.0%
 
ValueCountFrequency (%) 
Common 3 100.0%
 
ValueCountFrequency (%) 
ASCII 3 100.0%
 

DistanceFromHome
Real number (ℝ≥0)

Distinct count29
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.19251700680272
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median7
Q314
95-th percentile26
Maximum29
Range28
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.106864436
Coefficient of variation (CV)0.8818982254
Kurtosis-0.2248334049
Mean9.192517007
Median Absolute Deviation (MAD)5
Skewness0.9581179957
Sum13513
Variance65.72125098
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 211 14.4%
 
1 208 14.1%
 
10 86 5.9%
 
9 85 5.8%
 
3 84 5.7%
 
7 84 5.7%
 
8 80 5.4%
 
5 65 4.4%
 
4 64 4.4%
 
6 59 4.0%
 
Other values (19) 444 30.2%
 
ValueCountFrequency (%) 
1 208 14.1%
 
2 211 14.4%
 
3 84 5.7%
 
4 64 4.4%
 
5 65 4.4%
 
ValueCountFrequency (%) 
29 27 1.8%
 
28 23 1.6%
 
27 12 0.8%
 
26 25 1.7%
 
25 25 1.7%
 

Education
Real number (ℝ≥0)

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.912925170068027
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile4
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.024164945
Coefficient of variation (CV)0.3515932902
Kurtosis-0.5591149664
Mean2.91292517
Median Absolute Deviation (MAD)1
Skewness-0.289681082
Sum4282
Variance1.048913834
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3 572 38.9%
 
4 398 27.1%
 
2 282 19.2%
 
1 170 11.6%
 
5 48 3.3%
 
ValueCountFrequency (%) 
1 170 11.6%
 
2 282 19.2%
 
3 572 38.9%
 
4 398 27.1%
 
5 48 3.3%
 
ValueCountFrequency (%) 
5 48 3.3%
 
4 398 27.1%
 
3 572 38.9%
 
2 282 19.2%
 
1 170 11.6%
 

EducationField
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.8836734693877553
Minimum0
Maximum5
Zeros27
Zeros (%)1.8%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.289616109
Coefficient of variation (CV)0.3320609005
Kurtosis0.580226409
Mean3.883673469
Median Absolute Deviation (MAD)1
Skewness-1.174710388
Sum5709
Variance1.66310971
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5 606 41.2%
 
4 464 31.6%
 
3 159 10.8%
 
2 132 9.0%
 
1 82 5.6%
 
0 27 1.8%
 
ValueCountFrequency (%) 
0 27 1.8%
 
1 82 5.6%
 
2 132 9.0%
 
3 159 10.8%
 
4 464 31.6%
 
ValueCountFrequency (%) 
5 606 41.2%
 
4 464 31.6%
 
3 159 10.8%
 
2 132 9.0%
 
1 82 5.6%
 

EmployeeNumber
Real number (ℝ≥0)

UNIQUE
Distinct count1470
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1024.865306122449
Minimum1
Maximum2068
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum1
5-th percentile96.45
Q1491.25
median1020.5
Q31555.75
95-th percentile1967.55
Maximum2068
Range2067
Interquartile range (IQR)1064.5

Descriptive statistics

Standard deviation602.0243348
Coefficient of variation (CV)0.5874180063
Kurtosis-1.223178906
Mean1024.865306
Median Absolute Deviation (MAD)533.5
Skewness0.01657401958
Sum1506552
Variance362433.2997
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2048 1 0.1%
 
1368 1 0.1%
 
1364 1 0.1%
 
1363 1 0.1%
 
1362 1 0.1%
 
1361 1 0.1%
 
1360 1 0.1%
 
1358 1 0.1%
 
1356 1 0.1%
 
1355 1 0.1%
 
Other values (1460) 1460 99.3%
 
ValueCountFrequency (%) 
1 1 0.1%
 
2 1 0.1%
 
4 1 0.1%
 
5 1 0.1%
 
7 1 0.1%
 
ValueCountFrequency (%) 
2068 1 0.1%
 
2065 1 0.1%
 
2064 1 0.1%
 
2062 1 0.1%
 
2061 1 0.1%
 
Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
453
4
446
2
287
1
284
ValueCountFrequency (%) 
3 453 30.8%
 
4 446 30.3%
 
2 287 19.5%
 
1 284 19.3%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Gender
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
1
882
0
588
ValueCountFrequency (%) 
1 882 60.0%
 
0 588 40.0%
 

HourlyRate
Real number (ℝ≥0)

Distinct count71
Unique (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.89115646258503
Minimum30
Maximum100
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum30
5-th percentile33
Q148
median66
Q383.75
95-th percentile97
Maximum100
Range70
Interquartile range (IQR)35.75

Descriptive statistics

Standard deviation20.32942759
Coefficient of variation (CV)0.3085304415
Kurtosis-1.196398456
Mean65.89115646
Median Absolute Deviation (MAD)18
Skewness-0.0323109529
Sum96860
Variance413.2856263
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
66 29 2.0%
 
42 28 1.9%
 
98 28 1.9%
 
84 28 1.9%
 
48 28 1.9%
 
96 27 1.8%
 
79 27 1.8%
 
57 27 1.8%
 
87 26 1.8%
 
56 26 1.8%
 
Other values (61) 1196 81.4%
 
ValueCountFrequency (%) 
30 19 1.3%
 
31 15 1.0%
 
32 24 1.6%
 
33 19 1.3%
 
34 12 0.8%
 
ValueCountFrequency (%) 
100 19 1.3%
 
99 20 1.4%
 
98 28 1.9%
 
97 21 1.4%
 
96 27 1.8%
 

JobInvolvement
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
868
2
375
4
 
144
1
 
83
ValueCountFrequency (%) 
3 868 59.0%
 
2 375 25.5%
 
4 144 9.8%
 
1 83 5.6%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

JobLevel
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0639455782312925
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.106939899
Coefficient of variation (CV)0.5363222319
Kurtosis0.3991520554
Mean2.063945578
Median Absolute Deviation (MAD)1
Skewness1.025401283
Sum3034
Variance1.22531594
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 543 36.9%
 
2 534 36.3%
 
3 218 14.8%
 
4 106 7.2%
 
5 69 4.7%
 
ValueCountFrequency (%) 
1 543 36.9%
 
2 534 36.3%
 
3 218 14.8%
 
4 106 7.2%
 
5 69 4.7%
 
ValueCountFrequency (%) 
5 69 4.7%
 
4 106 7.2%
 
3 218 14.8%
 
2 534 36.3%
 
1 543 36.9%
 

JobRole
Real number (ℝ≥0)

ZEROS
Distinct count9
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.446938775510204
Minimum0
Maximum8
Zeros52
Zeros (%)3.5%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile1
Q14
median6
Q37
95-th percentile8
Maximum8
Range8
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.323901502
Coefficient of variation (CV)0.4266435879
Kurtosis-0.4613439842
Mean5.446938776
Median Absolute Deviation (MAD)2
Skewness-0.7701490135
Sum8007
Variance5.400518192
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
8 326 22.2%
 
7 292 19.9%
 
6 259 17.6%
 
5 145 9.9%
 
4 131 8.9%
 
3 102 6.9%
 
2 83 5.6%
 
1 80 5.4%
 
0 52 3.5%
 
ValueCountFrequency (%) 
0 52 3.5%
 
1 80 5.4%
 
2 83 5.6%
 
3 102 6.9%
 
4 131 8.9%
 
ValueCountFrequency (%) 
8 326 22.2%
 
7 292 19.9%
 
6 259 17.6%
 
5 145 9.9%
 
4 131 8.9%
 

JobSatisfaction
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
4
459
3
442
1
289
2
280
ValueCountFrequency (%) 
4 459 31.2%
 
3 442 30.1%
 
1 289 19.7%
 
2 280 19.0%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

MaritalStatus
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
2
673
1
470
0
327
ValueCountFrequency (%) 
2 673 45.8%
 
1 470 32.0%
 
0 327 22.2%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 3 100.0%
 
ValueCountFrequency (%) 
Common 3 100.0%
 
ValueCountFrequency (%) 
ASCII 3 100.0%
 

MonthlyIncome
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1349
Unique (%)91.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6502.931292517007
Minimum1009
Maximum19999
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum1009
5-th percentile2097.9
Q12911
median4919
Q38379
95-th percentile17821.35
Maximum19999
Range18990
Interquartile range (IQR)5468

Descriptive statistics

Standard deviation4707.956783
Coefficient of variation (CV)0.7239745541
Kurtosis1.005232691
Mean6502.931293
Median Absolute Deviation (MAD)2199
Skewness1.369816681
Sum9559309
Variance22164857.07
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2342 4 0.3%
 
6142 3 0.2%
 
2610 3 0.2%
 
2559 3 0.2%
 
6347 3 0.2%
 
2404 3 0.2%
 
3452 3 0.2%
 
5562 3 0.2%
 
2451 3 0.2%
 
2741 3 0.2%
 
Other values (1339) 1439 97.9%
 
ValueCountFrequency (%) 
1009 1 0.1%
 
1051 1 0.1%
 
1052 1 0.1%
 
1081 1 0.1%
 
1091 1 0.1%
 
ValueCountFrequency (%) 
19999 1 0.1%
 
19973 1 0.1%
 
19943 1 0.1%
 
19926 1 0.1%
 
19859 1 0.1%
 

MonthlyRate
Real number (ℝ≥0)

Distinct count1427
Unique (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14313.103401360544
Minimum2094
Maximum26999
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum2094
5-th percentile3384.55
Q18047
median14235.5
Q320461.5
95-th percentile25431.9
Maximum26999
Range24905
Interquartile range (IQR)12414.5

Descriptive statistics

Standard deviation7117.786044
Coefficient of variation (CV)0.4972915967
Kurtosis-1.2149561
Mean14313.1034
Median Absolute Deviation (MAD)6206.5
Skewness0.01857780789
Sum21040262
Variance50662878.17
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4223 3 0.2%
 
9150 3 0.2%
 
9096 2 0.1%
 
13008 2 0.1%
 
12858 2 0.1%
 
6881 2 0.1%
 
23016 2 0.1%
 
8952 2 0.1%
 
20364 2 0.1%
 
22102 2 0.1%
 
Other values (1417) 1448 98.5%
 
ValueCountFrequency (%) 
2094 1 0.1%
 
2097 1 0.1%
 
2104 1 0.1%
 
2112 1 0.1%
 
2122 1 0.1%
 
ValueCountFrequency (%) 
26999 1 0.1%
 
26997 1 0.1%
 
26968 1 0.1%
 
26959 1 0.1%
 
26956 1 0.1%
 

NumCompaniesWorked
Real number (ℝ≥0)

ZEROS
Distinct count10
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.6931972789115646
Minimum0
Maximum9
Zeros197
Zeros (%)13.4%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile8
Maximum9
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.498009006
Coefficient of variation (CV)0.9275254455
Kurtosis0.01021381669
Mean2.693197279
Median Absolute Deviation (MAD)1
Skewness1.026471112
Sum3959
Variance6.240048994
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 521 35.4%
 
0 197 13.4%
 
3 159 10.8%
 
2 146 9.9%
 
4 139 9.5%
 
7 74 5.0%
 
6 70 4.8%
 
5 63 4.3%
 
9 52 3.5%
 
8 49 3.3%
 
ValueCountFrequency (%) 
0 197 13.4%
 
1 521 35.4%
 
2 146 9.9%
 
3 159 10.8%
 
4 139 9.5%
 
ValueCountFrequency (%) 
9 52 3.5%
 
8 49 3.3%
 
7 74 5.0%
 
6 70 4.8%
 
5 63 4.3%
 

OverTime
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
0
1054
1
416
ValueCountFrequency (%) 
0 1054 71.7%
 
1 416 28.3%
 

PercentSalaryHike
Real number (ℝ≥0)

Distinct count15
Unique (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.209523809523809
Minimum11
Maximum25
Zeros0
Zeros (%)0.0%
Memory size11.6 KiB

Quantile statistics

Minimum11
5-th percentile11
Q112
median14
Q318
95-th percentile22
Maximum25
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.659937717
Coefficient of variation (CV)0.2406346025
Kurtosis-0.3005982221
Mean15.20952381
Median Absolute Deviation (MAD)2
Skewness0.8211279756
Sum22358
Variance13.39514409
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11 210 14.3%
 
13 209 14.2%
 
14 201 13.7%
 
12 198 13.5%
 
15 101 6.9%
 
18 89 6.1%
 
17 82 5.6%
 
16 78 5.3%
 
19 76 5.2%
 
22 56 3.8%
 
Other values (5) 170 11.6%
 
ValueCountFrequency (%) 
11 210 14.3%
 
12 198 13.5%
 
13 209 14.2%
 
14 201 13.7%
 
15 101 6.9%
 
ValueCountFrequency (%) 
25 18 1.2%
 
24 21 1.4%
 
23 28 1.9%
 
22 56 3.8%
 
21 48 3.3%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
1244
4
 
226
ValueCountFrequency (%) 
3 1244 84.6%
 
4 226 15.4%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 
Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
459
4
432
2
303
1
276
ValueCountFrequency (%) 
3 459 31.2%
 
4 432 29.4%
 
2 303 20.6%
 
1 276 18.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

StockOptionLevel
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
0
631
1
596
2
158
3
 
85
ValueCountFrequency (%) 
0 631 42.9%
 
1 596 40.5%
 
2 158 10.7%
 
3 85 5.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

TotalWorkingYears
Real number (ℝ≥0)

Distinct count40
Unique (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.279591836734694
Minimum0
Maximum40
Zeros11
Zeros (%)0.7%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile1
Q16
median10
Q315
95-th percentile28
Maximum40
Range40
Interquartile range (IQR)9

Descriptive statistics

Standard deviation7.780781676
Coefficient of variation (CV)0.6898105701
Kurtosis0.9182695366
Mean11.27959184
Median Absolute Deviation (MAD)4
Skewness1.117171853
Sum16581
Variance60.54056348
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10 202 13.7%
 
6 125 8.5%
 
8 103 7.0%
 
9 96 6.5%
 
5 88 6.0%
 
7 81 5.5%
 
1 81 5.5%
 
4 63 4.3%
 
12 48 3.3%
 
3 42 2.9%
 
Other values (30) 541 36.8%
 
ValueCountFrequency (%) 
0 11 0.7%
 
1 81 5.5%
 
2 31 2.1%
 
3 42 2.9%
 
4 63 4.3%
 
ValueCountFrequency (%) 
40 2 0.1%
 
38 1 0.1%
 
37 4 0.3%
 
36 6 0.4%
 
35 3 0.2%
 

TrainingTimesLastYear
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7993197278911564
Minimum0
Maximum6
Zeros54
Zeros (%)3.7%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.289270621
Coefficient of variation (CV)0.4605656896
Kurtosis0.494992986
Mean2.799319728
Median Absolute Deviation (MAD)1
Skewness0.5531241711
Sum4115
Variance1.662218734
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 547 37.2%
 
3 491 33.4%
 
4 123 8.4%
 
5 119 8.1%
 
1 71 4.8%
 
6 65 4.4%
 
0 54 3.7%
 
ValueCountFrequency (%) 
0 54 3.7%
 
1 71 4.8%
 
2 547 37.2%
 
3 491 33.4%
 
4 123 8.4%
 
ValueCountFrequency (%) 
6 65 4.4%
 
5 119 8.1%
 
4 123 8.4%
 
3 491 33.4%
 
2 547 37.2%
 

WorkLifeBalance
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
893
2
344
4
 
153
1
 
80
ValueCountFrequency (%) 
3 893 60.7%
 
2 344 23.4%
 
4 153 10.4%
 
1 80 5.4%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

YearsAtCompany
Real number (ℝ≥0)

ZEROS
Distinct count37
Unique (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.0081632653061225
Minimum0
Maximum40
Zeros44
Zeros (%)3.0%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q39
95-th percentile20
Maximum40
Range40
Interquartile range (IQR)6

Descriptive statistics

Standard deviation6.126525152
Coefficient of variation (CV)0.8741984056
Kurtosis3.935508756
Mean7.008163265
Median Absolute Deviation (MAD)3
Skewness1.764529454
Sum10302
Variance37.53431044
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5 196 13.3%
 
1 171 11.6%
 
3 128 8.7%
 
2 127 8.6%
 
10 120 8.2%
 
4 110 7.5%
 
7 90 6.1%
 
9 82 5.6%
 
8 80 5.4%
 
6 76 5.2%
 
Other values (27) 290 19.7%
 
ValueCountFrequency (%) 
0 44 3.0%
 
1 171 11.6%
 
2 127 8.6%
 
3 128 8.7%
 
4 110 7.5%
 
ValueCountFrequency (%) 
40 1 0.1%
 
37 1 0.1%
 
36 2 0.1%
 
34 1 0.1%
 
33 5 0.3%
 

YearsInCurrentRole
Real number (ℝ≥0)

ZEROS
Distinct count19
Unique (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.229251700680272
Minimum0
Maximum18
Zeros244
Zeros (%)16.6%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile11
Maximum18
Range18
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.623137035
Coefficient of variation (CV)0.856685128
Kurtosis0.4774207735
Mean4.229251701
Median Absolute Deviation (MAD)3
Skewness0.9173631563
Sum6217
Variance13.12712197
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 372 25.3%
 
0 244 16.6%
 
7 222 15.1%
 
3 135 9.2%
 
4 104 7.1%
 
8 89 6.1%
 
9 67 4.6%
 
1 57 3.9%
 
6 37 2.5%
 
5 36 2.4%
 
Other values (9) 107 7.3%
 
ValueCountFrequency (%) 
0 244 16.6%
 
1 57 3.9%
 
2 372 25.3%
 
3 135 9.2%
 
4 104 7.1%
 
ValueCountFrequency (%) 
18 2 0.1%
 
17 4 0.3%
 
16 7 0.5%
 
15 8 0.5%
 
14 11 0.7%
 

YearsSinceLastPromotion
Real number (ℝ≥0)

ZEROS
Distinct count16
Unique (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1877551020408164
Minimum0
Maximum15
Zeros581
Zeros (%)39.5%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile9
Maximum15
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.222430279
Coefficient of variation (CV)1.472939213
Kurtosis3.612673115
Mean2.187755102
Median Absolute Deviation (MAD)1
Skewness1.984289983
Sum3216
Variance10.3840569
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 581 39.5%
 
1 357 24.3%
 
2 159 10.8%
 
7 76 5.2%
 
4 61 4.1%
 
3 52 3.5%
 
5 45 3.1%
 
6 32 2.2%
 
11 24 1.6%
 
8 18 1.2%
 
Other values (6) 65 4.4%
 
ValueCountFrequency (%) 
0 581 39.5%
 
1 357 24.3%
 
2 159 10.8%
 
3 52 3.5%
 
4 61 4.1%
 
ValueCountFrequency (%) 
15 13 0.9%
 
14 9 0.6%
 
13 10 0.7%
 
12 10 0.7%
 
11 24 1.6%
 

YearsWithCurrManager
Real number (ℝ≥0)

ZEROS
Distinct count18
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.12312925170068
Minimum0
Maximum17
Zeros263
Zeros (%)17.9%
Memory size11.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.568136121
Coefficient of variation (CV)0.8653951654
Kurtosis0.1710580839
Mean4.123129252
Median Absolute Deviation (MAD)3
Skewness0.833450992
Sum6061
Variance12.73159537
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 344 23.4%
 
0 263 17.9%
 
7 216 14.7%
 
3 142 9.7%
 
8 107 7.3%
 
4 98 6.7%
 
1 76 5.2%
 
9 64 4.4%
 
5 31 2.1%
 
6 29 2.0%
 
Other values (8) 100 6.8%
 
ValueCountFrequency (%) 
0 263 17.9%
 
1 76 5.2%
 
2 344 23.4%
 
3 142 9.7%
 
4 98 6.7%
 
ValueCountFrequency (%) 
17 7 0.5%
 
16 2 0.1%
 
15 5 0.3%
 
14 5 0.3%
 
13 14 1.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

AgeAttritionBusinessTravelDailyRateDepartmentDistanceFromHomeEducationEducationFieldEmployeeNumberEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManager
0411211021125120943284159931947981113108016405
14901279281523161227225130249071023441103310717
237121373222144192216312090239661153207330000
3330113922345540563173229092315911113308338730
427025912214711403162234681663290123416332222
5320110052225841793164130681186400133308227736
65902132423341030814161226709964412041312321000
730021358224151141673163026931333510224211231000
83801216223351241442353195268787002142010239718
9360212992273413319432432523716577601332217327777

Last rows

AgeAttritionBusinessTravelDailyRateDepartmentDistanceFromHomeEducationEducationFieldEmployeeNumberEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManager
146029024682284420544073217113785848910143205315404
146150124101283320554139238101085416586411332120333220
146239027221241320562060248421203188280011311212220996
146331003252534205721743251199363787001932010239417
1464260211671531206040302123129662137800183405234200
14653601884223242061314142642257112290401733117335203
146639026132614206241422341299912145740153119537717
14672702155243520642187425226142517411204216036203
14684901102312342065416322822539013243201434017329608
146934026282834206821824263244041022820123106344312